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ABSTRACT 


To  save  storage,  a  program  is  usually 
written  so  that  each  variable  assumes 
several  values.  As  a  result,  a  pro¬ 
gram  is  usually  difficult  to  understand 
and  prone  to  errors.  For  an  important 
class  of  programs,  it  will  be  shown  that 
they  can  be  written  with  complete  free¬ 
dom  in  the  naming  of  variables;  leaving 
the  task  of  minimizing  storage  require¬ 
ments  to  the  computer  itself. 


Optimal  Assignment  of  Computer  Storage  by  Chain 
Decomposition  of  Partially  Ordered  Sets 

by 

George  B.  Dantzig  and  Gary  H.  Reynolds 


THE  PROBLEM:  Given  a  sequence  of  n  steps,  on  the  k-th  step  a  value  is 

computed  as  a  function  F^  of  the  previously  computed  values  vk  ^ 

In  general,  only  a  subset  of  these  is  required  to  compute  .  We  are  inter¬ 
ested  in  finding  the  minimal  number  of  locations  in  the  memory  of  the  computer 
to  store  the  values  v.  so  that  they  will  be  available  for  computing  the  suc¬ 
cessive  functions  .  A  saving  in  the  number  of  locations  occurs  whenever  a 
value  is  stored  in  the  same  location  as  a  previously  computed  .which 

is  no  longer  needed  for  step  k+1 ,  k+2,...,  n  . 

MOTIVATION:  Consider  the  following  trivial  example: 


Vj  *  given 
*  given 


v2  +  v3 


5. 


v3  +  v4 


6.  Output  =  Vj 


The  arrows  in  the  figure  indicate  the  node  values  needed  to  compute  ,  Vj,... 
Thus  Vj  requires  and  Vj  ,  etc.  The  logical  dependence  is,  accordingly. 
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v j  *  Fj (constant  given) 
V2  *  F2(constant  given) 

v3  *  F3^v1 ’  v2^ 

v4  *  F4^v2’  v3^ 

v5  *  F5^v3*  v4^ 

Output  *  F<;(vc) 


A  programmer  who  is  a  memory  miser  would  notice  the  dependence  of  one  step  on 
another  and  would  store  in  the  same  location  as  (since  Vj  is  not  needed 

after  step  3) »  in  the  same  location  as  V2  ,  and  v,.  in  the  same  location  as 
Vj  .  Only  two  locations  are  needed  which  he  calls,  say,  "x"  and  "y'1.  He  accordingly 
writes  the  following  program: 

Ste£ 

1 .  x  *  given  input 

2.  y  *  given  input 

3.  x  *  x+y 

k.  y  =  y+x 

5.  x  =  x+y 

6.  output  x 

or  some  such  nonsense,  which  we  will  refer  to  as  "memory  misering  algebra." 

This  multiple  use  of  the  same  symbol  is  a  recognized  cause  of  program  error. 

It  is  one  of  the  primary  reasons  why  one  programmer  has  the  greatest  difficulty  in 
understanding  a  program  written  by  another  (or  even  one  by  himself).  To  avoid  mul¬ 
tiple  use  of  the  same  symbol  for  memory  misering,  a  programmer  can  make  use  of  spe¬ 
cial  instructions  which  will  direct  the  machine  language  compiler  to  store  the  values 
of  different  symbols  in  the  same  location.  This  is  of  some  help,  but  leaves  the 
task  of  conserving  storage  location  up  to  the  programmer  and  again  is  subject  to 
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error. 

Our  thesis  is  that  memory  misering  5>  essentially  clerical  in  nature,  a  task 
unworthy  of  the  programmer's  time.  We  will  show  for  one  important  class  of  pro¬ 
grams  that  the  task  of  conservation  of  memory  location  can  be  done  efficiently  by 
the  machine  as  part  of  its  translation  of  a  program  into  machine  language. 

SOLUTION:  Define  for  each  an  interval  of  storage.  If  is  last  needed  to 

compute  ,  then  its  interval  of  time  for  storage  is  from  step  k+1  to  l 

and  is  denoted  by 

Kvk)  *  [k+1,  £k] 

We  define  an  interval  l(vk)  as  coming  before  another  interval  l(vk,)  when 
-tk  <  k'+l  ,  which  we  write  as 

»(vj)  <  Kvk.)  if  £k<k'+l 


The  set  of  intervals  forms  a  partially  ordered  set  under  this  ordering  relation. 

It  is  obviously  transitive.  No  ordering  is  given  between  two  overlapping  intervals; 

such  intervals  are  said  to  be  unrelated.  A  subset  of  intervals  l(v.  ),  l(v.  ),... 

J1  J2 


,l(v.  )  is  said  to  be  completely  ordered  if 

j  s 


I  (v.  )  <  I  (v.  )<...<  I  (v.  ) 
J1  J2  Js 


We  will  refer  to  such  a  completely  ordered  subset  as  a  chain.  Obviously,  values 

v.  ,  v.  , . . . , v.  associated  with  the  intervals  in  a  chain  may  all  be  stored  in 
J1  J2  J  s 

the  same  storage  location. 

The  problem  of  finding  the  minimal  number  of  storage  locations  is  thus  the  same 
as  that  of  decomposing  a  partially  ordered  set  into  disjoint  subsets,  each  of  which 
is  completely  ordered.  This  is  called  a  chain  decomposition.  A  constructive  pro¬ 
cedure  for  doing  this  is  given  by  one  of  the  authors,  joint  with  Alan  Hoffman  [1], 


J* 


i* 


«m  connect  I OH  with  Di "worth's  Theorem  [2j.  in  Our  Special  application  here 
to  the  partially  ordered  set  of  intervals,  there  is  available,  however,  a  much 
easier  procedure.  This  can  be  found  in  Ford  and  Fulkerson  [3].  Applied  here,  it 
yields: 


RULE:  Store  in  the  same  location  as  any  y.  not  needed  for  any  step  after  k. 


It  is  obvious  that  the  application  of  the  rule  provides  a  valid  storage  pro¬ 
cedure  and  it  is  probably  equally  obvious  that  the  rule  yields  a  minimal  number  of 
storage  locations.  We  will,  nevertheless,  give  a  formal  proof. 

Up  to  step  k  ,  let  ^  be  the  subset  of  locations  used  to  store  t!'e  values 
Vj ,  V2,...,V|<  1  .  Let  L  be  any  location  in  the  set  j  ,  and  v.  the  last  value 
stored  in  L  at  the  start  of  step  k  .  Several  values  may  have  previously  been 
stored  in  L  ,  but  v.  refers  only  to  the  last  ^ne  stored  in  L  up  to  step  k  ; 
let  1^  be  the  storage  interval,  of  this  v.  . 

It  is  clear  that  cannot  be  stored  in  L  if  1^  overlaps  with  I (v^)  . 

If  l(vk)  overlaps  with  every  interval  1^  for  all  LeTk_^  ,  then  it  is  necessary 

to  increase  the  set  of  storage  locations  in  order  to  store  vk  .  In  this  case,  the 

number  of  storage  locations  in  has  to  be  one  greater  than  Nk_j  ,  the  number  of 

locations  in  .  In  general,  Nk  “  1  +  Nk-1  or  Nk  =  \_i  *  Let  us  suPPose 

that  on  step  k  ,  there  was  a  location  LeTk_j  such  that  I (L)  does  not  overlap 

with  l(vk)  ,  but  that  a  location  L  not  in  Tk_j  was  used  instead  for  storing 

vk  .  Note  that  on  subsequent  steps  the  vaiues  stored  in  L  or  L  cou*d  be  *nter~ 
changed  if  on  step  k  ,  location  L  were  used  in  place  of  L  .  This  interchange 
never  increases  the  count  of  the  locations  used  and  the  count  could  even  be  de¬ 
creased  if  L  is  never  used  and  is  dropped. 

Thus  we  have  shown  that  there  always  exists  a  minimal  storage  selection  that 
always  stores  for  each  k  the  value  vk  in  T(<_j  unless  1^  for  all  LeTk_j 
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m/A  rl  a  ne 
«-»*«*•  l  w^w 


Uit  f  U 

ri  i  w  •  • 


i/..  N 

’"V 


We  wish  to  show  that  any  selection  with  this  property 


is  minimal.  Let  k  *  k  be  the  lowest  index  k  such  that  N  *  ■  Max  N.  ;  then 

k  K 

I.+  N.  •  N  .  .  Thus  every  interval  I.  for  LeT  overlaps  with  that  of 
k-1  k  L  k  -1 

I (v  .  But  each  such  interval  beqins  before  I (v  *,)  ,  hence  ail  overlap  with 
k  ’  k 

the  value  k*+  1  ,  the  start  of  interval  l(v  .  Thus  all  N  *  intervals  of 

k  k' 

T  ^  have  the  value  k  +  1  in  common,  and  constitute  a  set  of  N  ^  unrelated 
k  k 


intervals  in  the  partially  ordered  set  of  intervals. 

Note  that  N  *  happens  to  be  also  equal  to  the  number  of  storage  locations 
k 

selected  to  carry  out  the  computations.  Associated  with  each  location  LeT  *  is 

k 

the  subset  of  values  v?  stored  in  L  on  steps  l,2,...,n  .  The  intervals 
I (vj)  of  these  v}  are  completely  ordered,  hence  form  a  chain. 

Thus  for  each  LeT  ^  ,  there  is  associated  a  mutually  exclusive  chain,  and 

k 

every  interval  in  the  original  partially  ordered  set  belongs  to  one  of  these 

chains.  Thus  we  can  decompose  the  partially  ordered  set  into  N  *  non-over- 

k 

lapping  chains.  Since  it  is  obvious  that  each  member  of  any  group  of  unrelated 

elements  must  belong  to  different  chains,  the  minimum  chain-decomposition  must 

always  be  greater  or  equal  to  the  maximum  number  of  unrelated  elements.  Hence, 

when  N  ,  the  number  of  chains  in  some  decomposition,  happens  vo  be  the  same 
k 

as  the  number  of  elements  In  some  set  of  unrelated  elements,  we  conclude  that 
this  can  only  occur  when  the  partially  ordered  set  has  been  decomposed  into  a 
minimal  number  of  chains.  This  completes  our  proof.  The  discussion  just  given 


Is  a  paraphrase  of  the  usual  proof  of  sufficiency  of  the  following: 
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DjLWOP.TH'S  THEOREM:  The  maximum  number  of  unrelated  elements  in  a  partially  or¬ 
dered  set  is  equal  to  the  number  of  chains  in  a  minimal  decomposition. 

APPLICATION  IF  THE  NUMBER  OF  STEPS  IS  SHALL:  The  task  of  the  compiler  will  be  to 
set  up  a  correspondence  between  location  addresses  and  symbols  used  in  the  program. 

If  there  are  n  steps  and  n  is  reasonably  smal I ,  then  the  following  pro¬ 
cedure  will  accomplish  the  minimum  storage  of  the  program.  Only  if  the  program  is 
to  be  executed  many  times  would  the  method  to  save  storage  given  below  be  worthwhile 
Set  aside  n  locations  Ak  for  recording  l ,  the  last  step  for  which  vk 

is  needed  for  computation.  Scan  each  step  l  in  turn  and  record  l  in  Ak  *f 

vk  *s  required  on  step  t  to  compute  .  The  final  value  of  t  recorded  in 
each  Ak  is  •  Note  that  for  any  v.  which  is  not  required  on  some  subsequent 

step  (such  as  v  )  the  value  ?n  A.  is  l.  =  0  . 

n  ii 

Set  up  a  way  of  generating  the  names  of  up  to  n-1  addresses  which  will  be 
called  upon  as  required  as  a  source  of  additional  addresses  for  storing  v.  .  The 

addresses  to  be  assigned  for  storing  v;  will  be  stored  in  n  locations 

B^f  B2»...,Bn  as  follows:  Generate  an  address  and  store  in  B^  ,  except  store  0 

if  Ly  -  0  .  For  each  k  =  1,2 .  n  ,  store  0  in  Bk  if  ~  0  \  otherwise, 

the  same  address  as  in  B.  where  i  is  the  first  i_  <  k  such  that  0  <  l.  <  k  . 

'0  0  0  »(T 

If  there  exists  no  such  iQ  ,  then  generate  a  new  location  address  and  store  it  in 

Bk  .  Note  that  0  in  Bk  is  to  be  interpreted  as  not  requiring  ar.  address  for 

Vi,  .  To  prevent  the  re-use  of  l.  (since  it  is  now  superceded  by  l.),  the  value 
k  i0  k 

of  t.  in  A.  is  replaced  by  "+»"  and  the  process  is  then  iterated.  Finally, 

'o  'o 

assign  the  address  in  B.  to  Vj  • 


IF  THE  NUMBER  Or  STEPS  iS  LARGE;  A  simple  example  will  suffice  to  show  a  funda¬ 
mental  difficulty  of  the  previous  procedure  when  the  number  of  steps  is  large  or 
unspecified  until  execution  time.  The  following  rou* ine  (assuming  no  mistakes) 
can  be  used  to  (inefficiently)  rearrange  m  numbers  in  ascending  order: 

Input  [Xj  j ,  x 1 2 . x]m} 

For  i  =  2 ,3  ....  .m 

Ail  =  X?-J ,1  * 

for  j  =  I ,2, . . . , (m-1 ) 

*i,j  =  MinCAi,j  • 

Ai,j+i  =  MaX*'Ai ,  j  ’  xi-l,j+l^ 

x.  =  A. 
i  ,m  i  ,m 

Output  fxmj  >  .  •  •  •  iX^)  . 


We  will  call  this  a  gener ic  algorithm  because  m  is  not  specified  until  execution 

time.  Here  we  wish  to  make  a  prior  decision  of  what  values  are  to  be  stored  in  t\e 

same  location  to  be  used  whatever  be  the  eventual  value  of  m  .  This  particular 

2 

routing  computes  2m  +  m  different  values.  For  m  =  1,000,  say,  it  would  not  be 

practical  to  apply  the  method  of  the  previous  section.  A  little  study  shows  that 

all  the  A.j  may  be  stored  in  a  single  location  and  all  the  vectors 

fx.  , ,  x.  - , . . . ,x.  }  in  the  same  m  locations  as  fx.  x.  .  }  .  Hence 

*■  i,l  i,2  i  ,mJ  1  i-l  ,1  i-l  ,rnJ 

only  m+1  memory  locations  are  required  by  this  routine  to  sort  m  numbers. 

This  illustrates  the  more  important  problem  which  we  are  working  on,  namely  that 
of  analyzing  the  structure  of  generic  routines  (i.e.,  those  with  unspecified  para¬ 
meters)  to  determine  the  minimal  assignment  to  storage  prior  to  specification. 
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